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Abstract. Assessing and monitoring vegetation as a grazing resource is 
often an integrated part of environmental, geographical and ecological re- 
search. This research paper explores the results and limitations of employ- 
ing a supervised Maximum Likelihood Classifier (MLC) and an unsuper- 
vised classifying approach to identify vegetation structure in a diverse sa- 
vanna biome. The two classification approaches are described, evaluated, 
and compared with in-situ field recordings and photographic field evidence. 
From this study it can be concluded that the use of medium resolution mul- 
tispectral imagery like the 10m SPOT 5 for pixel based classification of 
vegetation structure i n the study area is limited in its application value. The 
suitability of MLC and a hierarchical unsupervised classification is subject 
to the scale and accuracy requirements of the project. While uncertainties 
are present in all levels of the classification process, the inclusion of values 
from seasonal vegetation indices like the Normalized Difference Vegetation 
Index in a supervised classification may improve the accuracy of the result. 
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1. Introduction 

Background 

A common challenge faced within wildlife management and ecological stud- 
ies is the establishment of cost effective methods for vegetation classifica- 
tion, particularly in regions with high spatial variation in vegetation type 
and structure. Even when broad scale vegetation information for a study 
area i s avai I abl e, the spati al and temporal characteri sti cs of the data sources 
are often restrictive and not appropriate for the particular application need- 
ed. 



I n recent years the use of satellite imagery became a focal point of numer- 
ous vegetation related studies. As reported by Tucker (1979), vegetation 
indices for vegetation classification derived from Landsat imagery was al- 
readyin use during the early 1970's. Si nee then, the applicability of remote- 
ly sensed data for vegetation studies has been reported widely in the litera- 
ture (eg. Nagler, et al., 2002, Zhang, et al., 2003 & Kawamura, et al. 2004) 
and a myriad of sensors and remote sensing methods developed in quick 
succession. Various factors impact on the cost and expediency of a particu- 
lar solution and will influence its suitability towards a specific purpose. I n 
southern Africa data with a high temporal frequency, high spatial resolution 
or high spectral resolution are presently too expensive to be widely used as 
part of a geographic or environmental analysis. 

Study Area and Motivation 

Since 2003, the establishment of transfrontier conservation areas in south- 
ern Africa (Figure 1) facilitates the roaming of wildlife across international 
borders. 




Figure L Southern African Transfrontier Parks. 



Among South-African wildlife, the African Buffalo is known to be one of the 
main species responsible for the maintenance and potential spread of dis- 



eases such as Bovine tuberculosis (BTB) and Foot and Mouth disease 
(Garine-Wichatitsky et al. 2010). Concern exists about the spread and con- 
trol of these wildlife related diseases which may generate substantial eco- 
nomic losses for the livestock sector of beef producing regions (J ori et al. 
2009). Recent research suggests an association with the dispersion of path- 
ogens and the movement of buffalo (and cattle) across international bor- 
ders (Garine-Wichatitsky et al. 2010). Amongst several biotic and abiotic 
factors which affect the movement dynamics of animals, avail able forage is 
frequently noted as one of the main drivers (Baile et al. 2010, Bar-David et 
al. 2009, Winnieet.al. 2008). To deal with the concerns mentioned above, a 
project aiming to study the trans- boundary movements of buffalo within 
the Greater Limpopo Transfrontier Conservation Area (GLTFCA) was 
launched by CI RAD 1 in 2010. 

Due to the lack of detailed vegetation information in the study area an up- 
dated descri pti on of vegetati on structure wi 1 1 be a f i rst step towards a better 
understanding of current animal movement patterns. The vegetation classi- 
fication for this research is applied to a total area of 87.243km 2 which coin- 
cides with movement data associated with three buffalo and seven cattle 
herds tracked along the far northern boundary of the Kruger National Park 
(KNP), often referred to as the Pafuri area and an extended application area 
along its northern and western borders. 



Although the KNP has been the focus of scientific studies for decades, the 
Pafuri land system is remote with limited road access in some areas. Ac- 
cording to the most recent national vegetation map for South Africa (Mu- 
cina & Rutherford, 2006) the entire area falls within a savanna biome, but 
the zone is complex and diverse in natural characteristics and includes the 
Makuleke Wetlands, a system of inland pans within the Limpopo and 
Luvhuvhu floodplains which has been proclaimed an official Ramsar site in 
2007 ( www.ramsar.org ) . 

The Limpopo river valley and other areas where fence lines have been re- 
moved to aid natural movement of game between all sectors of the Great 
Limpopo Transfrontier Park, facilitates probable contact between buffalo 
and livestock. 



1 CI RAD is a French research centre working with developing countries to tackle interna- 
tional agricultural and development issues 



Problem statement 



The fact that savanna vegetation is essentially a double layered system with 
a canopy formed by varyi ng densities of trees and shrub and a lower layer of 
sub- strata consisting of grass and herbaceous vegetation - also in varying 
densities, results in a very complex problem when classifying vegetation 
with remotely sensors that essentially measures what is perceived from 
above(Hill etal.2011). The applicability of image data for vegetation struc- 
ture analysis is restricted by the scale of the pixel footprint, the size of the 
study area, the accuracy requirements of the project and the availability of 
imagery with appropriate spatial and temporal resolution. 

Research Objective 

Because researchers i n South Africa have reasonable access to SPOT i mage- 
ry, the pri nci pi e obj ecti ve i s to eval uate the potenti al of usi ng SPOT 5 mul ti - 
spectral imagery with a 10m spatial resolution to categorize the vegetation 
structure. An additional aim is to eval uate the usefulness of field survey as a 
ground truth validation tool within this research context. 



2. Materials and Methods 

2.1. Imagery 

Three SPOT5 images at a IB processing level with suitable acquisition dates 
were obtained from the National Space Agency of South Africa (SANSA). 
I mage processing software was used to co- register the images to an earlier 
2.5 m panchromatic geo-corrected SPOT image. Due to the natural status of 
the area and the lack of discernible man-made features, co- registration 
proved challenging but was achieved with a high level of accuracy and cor- 
relation between two images, one coinciding with the end- of- growing- 
season (30/04/201]) and the other typical of the end-of-dry season 
(12/08/2011). Subsets representing the study area are used for the classifi- 
cation procedures investigated in this paper. Radiance and reflectance val- 
ues of the pixels in all the images were derived to facilitate the stacking of 
bands from more than oneimage during classification. 

2.2. Classification of Vegetation Structure 

Within a multi -dimensional continuum of physical data I ike vegetation cov- 
er, any classification is essentially an abstraction (Edwards, 1983). The 
choice of the number and type of vegetation structural classes is in part de- 
pendent on the required product but also dependant on what can reasona- 
bly be derived from the imagery. Available vegetation classification results 



specific to savanna regions in South African conservation areas are essen- 
tially once-off studies depending heavily on extensive fieldwork, precise 
sampling methods and expert botanical knowledge (Gertenbach 1983, Van 
Rooyen 1978). The requirements of this project dictate a more robust pro- 
cess that is repeatable and adaptable to work already in motion in other 
parts of the GLTFCA. Four short field visits by a geographically inclined 
researcher did not warrant a floristically based approach or even very pre- 
cise field measurements. The study is therefore mostly reliant on the SPOT5 
image classification supported bythein-situ field recordings. 

2.3. In-situ structural classification 

Vegetation structure is described as the combination of the horizontal dis- 
tribution and vertical characteristics of dominant plants in an area (Hnati- 
uket.al 2009). For the field- based observations in the Pafuri study region, a 
very practical approach developed and described by Edwards (1983) is ap- 
plied. Edwards acknowledged the need for a stable classification scheme 
that may be used in non-plant specific research disciplines and refers to his 
structural classification as "purely complementary to" and "independant of" 
floristic and other forms of vegetation classification. 

With the introduction of a two-way matrix depending on structural groups 
and formation classes, the method that Edwards offers provides a practical 
and hierarchical structural procedure using estimations based on growth 
form, cover and height - aided by information about the substrata. In a 
simplified description, the observable "growth forms" are represented by 
trees, shrubs, grasses and herbaceous plant forms whilst "cover" refers to 
the vertical projection of the plant onto the ground. The cover of the upper 
growth form stratum is fundamental to the definition of class irrespective of 
height. Height is then added as an ordinal measure adapted to each growth 
form. (Figure2) 




Figure 2. Example of ordinal measures applied to the "tree" growth form 
(Edwards, 1983). 



An adapted "Edwards" classification sheet were used during four seasonally 
based field surveys. A convenience sampling method was adopted as each 
site had to be accessi bl e from a road. Several factors restri cted the f i el d pro- 
cedure that could be followed by a single researcher in this remote location. 
Fieldwork in theKNP is constrained by park regulations; field visits had to 
be planned in advance and could not be altered if conditions were not suit- 
able. As a result only 24 of the 33 control sites could be included in this 
study. 

To evaluate the researcher's ability to estimate projected ground cover of 
trees in the field, a supervised classification using recent (2008) aerial pho- 
tography is used to map only trees, sub-strata, and shade in a rectangle 
broadly presenting each field site where trees are present (Figure 3). Class 
stati sti cs i 1 1 ustrate the percentage cover. F i el d based canopy esti mates i n al I 
cases correlated well with the results obtained from the aerial photo classi- 
fication. 







Trees (48H) 




Shade (12K) 




Sub-strata nrs nil 



Figure 3. Example of tree canopy and class statistics calculation for one site. 



2.4. Class selection for image analysis 

As it was anti ci pated that the 10m spati al resol uti on of the SPOT 5 data wi 1 1 
not be suitable to make clear distinctions between all of the Edwards vege- 
tation classes, several classes were consolidated into seven vegetation clas- 
ses, as well as a Bare Soil and a Water class (Table 1). This was done in col- 
laboration with other researchers from CI RAD working in Zimbabwe and 
Mozambique. 



Consolidated 
Classes 


Corresponding Edwards classes and characteristics thereof 

(Forbes/Herb cover is treated the same as grassland classes) 


Riverine Forest 


Forest: 75-1 00% tree cover up to 20+ m. Shrub <1 0% if > 1 m high 


Woodland 


Closed Woodland: 10-75% tree cover up to 20+ m. Shrub <10% if > 1m high 


Open woodland 


Open Woodland: 1-10% tree cover up to 20+ m. Shrub <10% if > 1 m high 
Sparse Woodland: 0.1-1% tree cover up to 20+ m. Shrub <10% if > 1m high 


Bushland 


Thicket & Bushland: 1-100% tree cover up to 10 m. Shrub 10-100% and > 1m high 
Closed Shrubland: 10-100% shrub cover up to 5m high 


Open bushland 


Open Shrubland: 1 -1 0% shrub cover up to 5m high 


Grassland 


Closed Grassland: 10-100% grass cover up to 2m+ high 
Open Grassland: 1 -1 0% grass cover up to 2m+ high 


Sparse 


Sparse Shrubland: 0.1-1% shrub cover up to 5m high 



vegetation cover 


Sparse Grassland: 1-10% grass cover up to 2m+ high 


Bare soil 


Not applicable 


Water 


Not applicable 



Table L Summary of class consolidations. 



2.5. Image analysis 

Two pixel based image classification approaches are investigated in this 
paper. Firstly the widely used supervised Maximum Likelihood Classifier 
(MLC) is applied using a set of image specific training areas referred to as 
Regions of Interest (ROIs). With this classifier each pixel is consistently 
allocated to the class with which it has the highest calculated probability of 
class membership (Adams & Gillespie 2006). Thirty ROIs for each class, 
created by visually inspecting the images using a false colour display, are 
employed as training areas. To limit statistical distortion in the classifica- 
tion results all vegetation ROIs contain a similar number of pixels. Con- 
sistency when selecting similar ROIs is achieved by keeping a record of typ- 
ical reflectance values and spectral plots for each region. 

A f i rst i nvesti gati ve cl assif i cati on of the August i mage showed severe conf u- 
sion between classes. A similar classification for the April image demon- 
strated less confusion but still clear misrepresentation with regards to 
shade and water in deep river valleys, and also between iron wood forests, 
shade and other woodland areas. This was addressed by adding two more 
training regions for the deep river valley and the spectrally distinct iron 
wood forests respectively. 

Due to the differences in temporal changes between various plant species 
and in a bit to improve the application value of the classification, the Nor- 
malized Difference Vegetation I ndex (NDVI ), often referred to as a "green- 
ness" index was calculated for all values in both images. As explained in 
Beck et al. (2007), vegetation indices are valuable for investigating tem- 
poral vegetation dynamics. Work by Van Bommel et al. published in 2006 
demonstrates the capability of NDVI-based analysis techniques within a 
spatially and temporally varied landscape by integrating physiognomy and 
NDVI in a "nested NDVI-based classification" of forage distribution in the 
Okavango Delta, Botswana. 

To accommodate NDVI values in the classification all available bands from 
the two images and the derived NDVI bands are stacked and the "separabil- 
ity" of the updated eleven ROIs calculated. Derived pair separability of the 
chosen ROIs illustrates potential challenges for the supervised classification 
method. Band segregation probability for the eleven chosen ROIs is shown 
by displaying the mean spectral profiles for each class in each band in a 
composite graph (Figure4). 
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Figure 4: Band segregation probability as derived from spectral profiles using the 
mean values. 

From the graph it is clear that the spectral profiles of several classes are 
very similar it is also evident that the two NDVI bands may be useful in 
separating some of the challenging areas. New classifications including dif- 
ferent combinations of stacked bands, illustrated visually better results. To 
reduce fragmentation, post classification procedures are used to interpret 
theinitial product and achieve a more thematically summarized image. The 
two water classes and the two woodland classes are combined followed by 
sieving and clumping functions and the application of a median filter. 

For the second classification method an unsupervised approach isfollowed. 
Unsupervised classifiers (K-means and Isodata) measures and locates clus- 
ters in the data space, but an analyst is required to identify these clusters 
(Adams & Gillespie 2006). The unsupervised classification process used 
here incorporates a succession of unsupervised classifications on the same 
data set in the foil owing hierarchical format: A large number of classes are 
created allowing up to 99 iterations. Clear identifiable classes are deter- 
mi ned and a mask created for the rest. Repeat the above process for the 
remaining pixels each time using the new mask created until all remaining 
pixels are classified. 

Validation 

Though it is acknowledged that the in-situ field data from 23 sites is not 
statistically enough for comprehensive quantitative validation testing a 
qualitative evaluation is provided comparing results of different classifiers 



with the estimated field classes as derived from the Edwards (1983) classifi- 
cation (Table 2 ). Results from the MLC classifier consistently out-perform 
that of the unsupervi sed cl assi f i cati on . 



Field 
Control 
Site 


In-situ classifi- 
cation 


Supervised Maximum Likelihood Classifier 
(MLC) 


Unsupervised 
Classifier 


Structural class 


f laccifiratinn 1 

V> IQ99IIIWQUUM X 

Bands: April image 
and April & August 

NDVI values 


C laccifiratirtn 9 

V IQ99II 11 CI LIU II £ 

Bands: April & 
August images and 

NDVI values 


Daccifiratinn 3 
viaaaiiiiciLiuii j 

Results from 

Classification 2 

after nost classifi- 
es 1 LC 1 UuJl LICIjjMI 

cation 


Classification 4 

(Unsupervised) 


1 


Open Woodland 


□ 


□ 


■ 




2 


Woodland 


□ 


□ 


□ 


1 


3 


Bushland 


■ 


□ 


■ 




4 


Onen Woodland 


- 


- 


- 




5 


Onen Bushland 

\J [J *w 1 1 U UJl MUMU 


- 


- 


- 


1 


6 


Woodland 


■ 


□ 


■ 




7 


Open Woodland 


□ 


□ 


- 


r— i 


8 


Open Bushland 


- 


- 


- 




9 


Onen Woodland 


□ 


□ 


□ 




10 


Sparse vegetation 
cover 


□ 


□ 


■ 




11 


Sparse vegetation 
cover 


□ 


□ 


■ 




12 


Open Bushland 


□ 








13 


Grass 


□ 


□ 


□ 




14 


Open Woodland 


□ 


□ 


□ 




15 


Bushland 


□ 


□ 


■ 


□ 


16 


Open Woodland 


□ 


□ 


□ 




17 


Riverine Forest 


■ 


■ 


■ 


■ 


18 


Open Woodland 










19 


Open Bushland 


■ 


■ 


■ 


■ 


20 


Bushland 


□ 


□ 


■ 


□ 


21 


Open Bushland 


■ 


□ 


■ 


□ 


22 


Sparse vegetation 
cover 


□ 


□ 


□ 




23 


Open Woodland 


■ 


□ 




□ 


24 


Riverine Forest 


□ 


□ 


■ 


■ 


SUMMARY 

■ = Perfect correlation 
□ = Partial correlation 
- = No correlation 


■ =6(25%) 
□ = 14 (58.3%) 
- =4(16.6%) 


■ =2(8.3%) 
□ = 17 (70.8%) 
- =5(20.8%) 


■ =11(45.8%) 
□ =6(25%) 
- =7(29.2%) 


■ =5(20.8%) 
□ =5(20.8%) 
- =14(58.3%) 



Table 2: Summarized comparison of field classification versus various image clas- 
sification results from different band combinations. 



For a quantitative assessment of Classification 3 in Table 2 another set of 
30 validation points for each class were created from Google Earth and Aer- 
ial photography by the skilled botanist Dr Pierre Poilecot, formerly from 
CI RAD. A confusion matrix derived from the result achieved an overall ac- 
curacy of 65% with a Kappa Coeffi ci ent of 0.6. 



3. Discussion 

All classification results illustrate considerable confusion between the more 
open vegetation classes, eg. Open Bushland, Open Woodland and the 
Sparse Vegetati on Cover cl ass. CI ass results from the conf usi on matri x indi- 
cate very high accuracy levels (above 90%) for the Riverine Forest, Water 
and Bare Soil classes. A qualitative assessment of the points used for valida- 
tion revealed that using point data for validation may result in misrepresen- 
tation as the position of the point may fall just inside or outside of a class. 
However, using a buffer region around the original validation points did not 
significantly improve the overall accuracy level. Furthermore it is clear that 
uncertainty and fuzziness is incorporated in all levels of the classification 
process, from the choice of vegetation classes, ROI selection (MLC) or class 
identification (unsupervised classifier) to the selection of ground truth vali- 
dation data. Sub- pixel classification or the incorporation of ancillary data at 
suitable scales may assist in improving accuracy levels while integration of 
"fuzzy" parameters as described by (Welikanna et al, 2012), (Dwivedi et al. 
2012) may i mprove the ability to scientifically assess and report uncertain- 
ties within classification results. 



4. Conclusion 

From this work it can be concluded that the use of medium resolution mul- 
tispectral imagery like 10m SPOT 5 for pixel based classification of vegeta- 
tion structure in a diverse savanna biomeis limited in its application value. 
The suitability of MLC and a hierarchical unsupervised classification will 
heavily depend on the scale requirements of the project. Future work 
should investigate ways to increase the ability to estimate and describe the 
level of uncertainty associated with each classification method in order to 
improve the application value of the result and enable functional thematic 
mapping of vegetation structure. It is hoped that the issues, limitations and 
results explained in this work will be of value to other researchers in fields 
of environmental analysis and management. 
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